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An On-Line Shopping Conversion Simulation Module 
Technical Field 

The present invention relates to the field of modeling and simulations. 
5 More specifically, the present invention pertains to an apparatus and method 
for modeling and simulating the conversion beviour of on-line shoppers. 

Background Art 

1 0 With the advent of the Internet, people can log on and shop on-line 

from the convenience of their home. Rather than physically driving to a 
store, hunting for merchandise, waiting in line to purchase the item, lugging 
bags around, and then driving back home, the Internet now enables 
shoppers to simply browse the websites of any one of a multitude of on-line 

1 5 retailers offering products for sale to the public. Consumers can browse the 
web pages of different on-line retailers to find the particular products they 
desire, shop for best price, determine the availability and features of the items 
of interest, and ultimately pay with a credit card. 

20 The on-line shopping experience is enjoying great popularity due to 

the ease and convenience by which people can access web sites and peruse 
the merchandise being offered. However, in order to be successful, the on- 
line retailer must convert the browsing public into active purchasers. Not 
only are on-line retailers faced with the task of getting potential shoppers to 

25 click on and visit their website, but the on-line retailers must then efficiently 
convert these would-be shoppers into buying their merchandise. 
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On-line retailers have several mechanisms by which they can entice 
browsers into actually buying their products. For instance, on-line retailers 
can offer promotions such as sales, buy-one-get-one-free, donating a portion 
of the sale to a customer's favorite charity, extended warranties, frequent- 
5 buyer programs, upgrades, financing packages, etc. However, the more 

promotions lavished into converting potential customers directly cuts into the 
retailer's profits. There must be some balance between the degree of 
promotions and the chance of converting a shopper into a buyer. 

1 0 One way in which to determine this delicate balance entails modeling 

the shopping behavior of on-line shoppers. In theory, the model would 
reliably predict the percent chance of converting an on-line shopper given a 
selected set of promotions. By using such a model, on-line retailers could 
adjust their promotions to maximize profits while minimizing the costs 

1 5 associated promotional costs. Moreover, on-line retailers could use this 
model to customize their web offerings and even offer specific sets of 
promotions tailored to individual potential customers visiting their sites. 
Indeed, intelligent software could use the model to automatically customize 
promotions to a particular visitor's known preferences, past history, 

20 demographics, ethnicity, etc. 

Thus, there exists a need for an accurate model for forecasting a on- 
line shopper's behavior. 
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DISCLOSURE OF THE INVENTION 

The present invention relates to an on-line shopping conversion 
simulation module. This on-line shopping conversion simulation module is 
5 used for predicting the chance of an on-line shopper being converted into 
becoming an actual purchaser of an item based on promotions offered by an 
on-line vendor. Sets of data including on-line customers' profile information; 
customer log information; product information corresponding to a plurality 
of products offered for sale by the on-line vendor; and promotion attributes 

1 0 corresponding to the plurality of products are stored in a database. Next, a 
model which simulates shopping behavior as a function of the customer 
profile information, customer log information, product information, and 
promotion attributes is constructed. This model is partially based on the 
traditional logistical regression theory and partially on the maximum utility 

1 5 theories. Thereby, the data corresponding to a new on-line shopper is input 
to the model which then compute a percentage likelihood that the shopper is 
converted into becoming a purchaser. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in and form a 
part of this specification, illustrate embodiments of the invention and, 
together with the description, serve to explain the principles of the invention: 

5 

Figure 1 shows a block diagram of an on-line shopping conversion 
simulation module according to the currently preferred embodiment of the 
present invention. 

1 0 Figure 2 shows the processes related to the currently preferred 

embodiment of the on-line shopping conversion simulation module. 

Figure 3 shows a generic personal computer upon which the present 
invention may be practiced. 

15 
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BEST MODE FOR CARRYING OUT THE INVENTION 

An apparatus and method for an on-line shopping conversion 
simulation module is described. In the following detailed description of the 
5 present invention, numerous specific details are set forth in order to provide 
a thorough understanding of the present invention. However, it will be 
obvious to one skilled in the art that the present invention may be practiced 
without these specific details or by using alternate elements or methods. In 
other instances well known methods, procedures, components, and circuits 
1 0 have not been described in detail as not to unnecessarily obscure aspects of 
the present invention. 

An on-line visitor to a shopping site may or may not be converted into 
a customer, that is, a visitor may or may not buy a promotion product. If the 
1 5 visitor does buy, the purchase quantity is also unknown to us beforehand. 
The most critical element to the success for any on-line and off-line shopping 
site is a deep understanding of what factors are relevant to, and how they are 
correlated to the conversion process. To this end, the present invention of an 
on-line shopping conversion simulation module has been developed. 

20 

In the currently preferred embodiment, the on-line shopping 
conversion simulation module of the present invention comprises of two 
components. One is the simulator that the users can simulate the conversion 
process, including the conversion status, and if converted, the purchase 
25 quantity, cost and revenue. The other component is the modeling part, 

which hides behind the users, but is the core machine that addresses exactly 
the aforementioned issues regarding the shopping behavior of customers. 

I 
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Referring to Figure 1, data regarding shoppers is stored in a database 101. 
This data is fed into the conversion simulation process 102. A modeling 
engine 103 is then used to generate the probability of converting a particular 
shopper into a purchaser. The resulting probability is displayed and stored as 
5 a percentage 104. 

Essentially, the simulator 102 and model 103 relate a customer's profile 
information and a product promotions' effects to the customer's conversion 
probability. The modeling part, partially based on the traditional logistical 

1 0 regression theory, and partially on the maximum utility theories developed 
by Nobel Prize winner Daniel McFadden, addresses the question why 
customers do or do not get converted from a visitor's status to a customer's 
status. The simulation part simulates the conversion behavior based on the 
model. This module can greatly facilitate the development of other analytic 

1 5 CRM (Customer Relationship Management) components, and serve as 
testing bed of any analytic CRM systems, as demonstrated in the eMO (e- 
Market Optimization) development. For example, without this module, the 
optimization module in eMO simply won't have any opportunity to be tested 
and refined before any real world application. 

20 

Figure 2 shows the processes related to the currently preferred 
embodiment of the on-line shopping conversion simulation module. Initially, 
a database containing on-line shoppers information is created and 
maintained, 201. This on-line shoppers information includes information 
25 regarding the profiles of various customers, 202. The profile information 
contains extensive information regarding a particular shopper, such as the 
shopper's age, sex, religion, income, ethnicity, marital status, geographical 
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location, number of children, interests, hobbies, spending habits, etc. Any 
information which may characterize a shopper is beneficial to be included in 
the customer profile information. Initially, these customer profiles can be 
purchased or accumulated and updated over time. 

5 

The on-line shoppers information also includes information relating to 
customers' web log information, 203. This log information contains data 
regarding when the customer accessed the web site, how long the customer 
visited the web site, which items were of interest, how the customer heard 

1 0 about the web site, whether the customer saw the promotion(s), whether the 
customer was motivated to taking action as a result of the promotion(s), 
whether the customer inspected an item, whether the customer put the item 
back, whether the customer bought an item, the quantity of items purchased, 
etc. The log information basically contains a historical account of a shopper's 

1 5 actions from the moment the shopper first enters the web site to when that 
shopper leaves the web site. This information is collected and stored with 
each visitor to the vendor's web site. 

The third set of data relating to the on-line shoppers information 
2 0 characterizes product and promotion attributes, 204. Each product is 

different from other products offered for sale on-line. As such, each product 
has its dedicated set of attributes. These attributes describe that particular 
item for sale and may include its price, color, make, model, manufacturer, 
size, weight, availability, features, functionalities, etc. Also included are 
25 promotions (if any) corresponding to each of the products offered for sale. 

These promotions are used to entice shoppers to purchasing a particular item. 
Promotions can include sales, upgrades, extended warranties, buy-one-get- 
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one free, financing packages, free options, rebates, coupons, donations to 
charities, free gifts, etc. These product and promotion attributes are known 
and set by the on-line vendor. The vendor may selectively vary one or more 
of these promotion attributes, depending on a particular shopper, a particular 
5 item, or a combination thereof. The manner by which the promotion 

attributes are set may be a function of the results generated by the on-line 
shopping conversion simulator. 

The next process involves building the on-line shopping conversion 

1 0 model and simulator, 205. In the currently preferred embodiment, the model 
and the simulator relate a customer's profile information and an on-line 
product promotion's effects to the customer's conversion probability. The 
modeling part, partially based on the traditional logistical regression theory, 
and partially on the maximum utility theories developed by this year's Nobel 

1 5 Prize winner, Daniel McFadden, addresses the question of why customers do 
or do not get converted from a visitor's status to a customer's status. The 
simulation part simulates the conversion behavior based on the model. This 
module can greatly facilitate the development of other analytic CRM 
components, and serve as testing bed of any analytic CRM systems, as 

20 demonstrated in the eMO development. For example, without this module, 
the optimization module in eMO simply will not have any opportunity to be 
tested and refined before any real world application. As part of the model 
building, a list of variables relating to model need to be first identified and 
selected, step 206. Furthermore, one or more key parameters must be 

25 estimated, step 207. 
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Based on the model constructed in process 205, one can predict the 
likelihood that a particular shopper will be converted into a purchaser, 208. 
When a new customer visits the web site, the chances for converting the 
customer into a buyer is calculated according to the initial training customer 
5 data set, 209. The model created in 205 then generates the chances of 

conversion. The customer's actual log information is collected and this new 
information is retained and fed back as relevant information. Based on this 
new information, the variable identification and selection process can be 
refined. Furthermore, a better estimation of the parameter(s) can be 
1 0 calculated. Thereby, the model can be continuously updated and improved 
upon with each new actual customer information being input to the overall 
process. 

In the currently preferred embodiment, the promotions and customer 

1 5 segments need to be defined. A promotion is defined as a set of attributes. 

For example, it can consist of the following: discount rate, free shipping & 
handling, rebate, special event promotional discount. For a customer 
segment, it is also defined as a set of attributes. For example, it can consist of 
the following: average time on site, purchased-on-line-before probability, 
20 product market saturation rate. Any individual customer from a segment is a 
stochastic realization from a model with the "mean value", which is specified 
by the mean attributes. For each attribute, there can be multiple levels. The 
following is a sample specification of Promotion and Class (Segment). It is an 
input to the SIM1, the simulator function that is implemented in S-Plus (a 

2 5 statistical computing language). 

list("Promotion" = list("special.prod.disc.rate.all" = c(5., 20., 40.) 
, "special.refsite.disc.rate.all" = c(5., 40.) 



1 0 

) 

, "Class" = list("seconds.on.site.lambda.all" = c(10., 60., 120.) 

, "purchased.online.bf.p.all" = 0(0.01, 0.4) 

) 

) 



For a product, one can introduce the notion of a multi-attribute. The 
simulation can be expanded to include that generalized promotion definition. 

1 0 Next, the simulation of a fixed combination of promotion and Segment 

is defined as a SIM1 function. For example: 

SPlus> SIM1(N = 1000., Customer.Group.ID = "CI", 
Promotion.ID = "PI", Spec =Spec.sdat) 
The SIM1 function simulates the on-line shopping activities, and the returned 

1 5 process value is to be used as a proxy for real on-line shopping data. There 
are many underlying patterns or customer behaviors that can be built based 
on econometric modeling experiences. Suppose a customer visits a shopping 
site. There are many products advertised on the site. For any single product, 
with the on line sales price information, and customer's knowledge, the 

20 customer can deduce the DiscountPercentageToOffline. The customer will 
then decide if he or she will get a good offer based on the utilities derived 
from all the information. The customer may or may not see the ad, and may 
or may not select the product, and may or may not finally buy it. For each 
corresponding step, a (conditional) logistic regression model is used to 

25 generate the process. The choice of logistic regression model is based on the 
maximum utility theory developed in the demand model. For example, one 
can model: 
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p(^ = i|s £fert = i)=-SE^- 



l + exp(^ X)' 



5 where X is a vector of vector: X = (X a ;X 2 ;X 3 ), X 1 is the customer profile 

information vector, X 2 is the promotion attribute vector, and X 3 is the product 
attribute vector. The following distributional property has been used: If Xj ~ 
B(l; p 1 ); X2 ~ B(l; p 21 ), independently distributed, then Y = XjX 2 ~ B(l; p 2 ), 
where p 2 =p 1 p 21 . 

10 

If the customer buys a product, the customer may buy more than one 
item. For the quantity sold, it is modeled to be statistically "proportional" to 
the discount effect. Specifically, the value takes a Poisson distribution, with 
the distribution mean proportional to the discount effect. However, there is 

1 5 upper limit for the purposes of inventory safety and of customer attraction 
distribution. In the currently preferred embodiment, a truncated Poisson 
Distribution is used so that the returned value is at least 1 and no greater than 
K. It should be noted that there is a holiday effect on people's buying 
behavior. Some holidays produce a positive effect, such as in Christmas, and 

20 some produce a negative effect, such as spring break or summer hot days. 

The Wealth Effect Index: zip+4 code and house/ apt ownership 
indication on shipping address. From the zip code, one can get the average 
house information: average income, average house member number. One 
2 5 can also can get a relationship between income and house ownership. The 
Zip code could also provide market saturation rate, which should also be a 
factor in the conversion model. In this particular implementation, customer 
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class (characteristics, a multi-attribute vector) variables are created and 
controlled (so that one can have a perfect segmentation): 

1. seconds.on.site.lambda 
5 2. purchased.online.before.probability 

Also, the promotion variables are created that are used to control: 

1. special.prod.disc.rate 
10 2. special.refsite.disc.rate 

The following is an example of the output from SIM1. SPlus> SIM1(5) 

Promotion.ID Customer. Group .ID Product.ID Ad.On.Days Holiday.Eff 

15 

1 PI CI D6 24 0 

2 PI CI D7 41 0 

3 PI CI D10 33 0 

4 PI CI D4 25 0 
20 5 PI CI D2 34 0 

Discount.Percentage Seconds .On.Site Refsite.URL ZipCode 

1 6 11 hr.org 70151 

2 6 10 hd.com 30164 
25 3 7 13 tk.com 20177 

4 2 11 zv.com 30180 

5 1 13 qg.com 70103 
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House.Ownership .Indicator Purchased.Online.Bf Did.See Did.Select 



1 1 


0 




1 


0 


2 0 


0 




1 


1 


3 1 


0 




1 


1 


4 0 


0 




1 


1 


5 0 


0 




1 


1 


Conversion.Prob Did.Buy 


Bought.Qty Revenue Discount.Loss Matl.Cost 


1 0.3371550 


0 


0 


0.00 


0.0000 0 


2 0.3338475 


1 


3 


1979.35 


120.6462 1700 


3 0.3665212 


0 


0 


0.00 


0.0000 0 


4 0.2784663 


0 


0 


0.00 


0.0000 0 


5 0.2800296 


0 


0 


0.00 


0.0000 0 



15 

Fixed.Cost Profit.a Profit.b 



1 


0 


0.00 0.00 


2 


0 


279.35 158.71 


3 


0 


0.00 0.00 


4 


0 


0.00 0.00 


5 


0 


0.00 0.00 



The on-line shopping conversion simulation module can also perform 
a simulation of several combinations of promotions and segments. Based on 
25 SIM1, SIM2 can simulate for (arbitrarily) several combinations of promotions 
and segments. For example: 
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SPlus> SIM2(N = l v data.dir = "Testing0809", Spec.file = "Spec.list") 

It should be noted that this is a full factorial design, which has no problem in 
affording in the simulation world. However, in the real world, for any single 
5 test of each combination, the cost is usually quite significant. Consequently, 
one has to consider the fractional factorial design. The output from SIM2 is 
similar to that from SIM1, except the output results are from the selected 
combinations of the controllable variables. 

1 0 With the output from this all combination simulation, the data is sent 

for the optimization engine to optimize over the space of segment and 
promotion, using different objective functions. Out of the OPT (Optimization 
program) are the optimization plans that are evaluated in the next step. The 
following example plan was derived from OPT, with the objective function 

1 5 being the conversion rate, and using the estimated conversion rate from the 
training data set: 

convest 
P6 1 CI 1 1.00 
20 P6IC2I1.00 
P61C3I1.00 
P61C4I1.00 
P6 1 C5 1 1.00 
P61C6I1.00 

25 

Using simple statistical analysis, statistically-driven" optimal plans can 
be derived, which are then used to compare with and test the OPT derived 
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plans. Specifically, for each combination of promotion and segment (Pi ; Cj), 
the average value of "performance metrics", including the conversion rate, 
cost, revenue, and profit are computed. Then for any given objective 
function, for example, the conversion rate, the first six combinations that 
5 have the largest values are selected. The reason to use six combinations is to 
maintain the same total customer base, since for all combination, one would 
allocate to the same number of customers. The following is a sample 
comparison report. 

1 0 SPlus> Eval.s(N=2000, data.dir = "Testing0801") 



Conversion.Rate Gross.Rev Discount.Loss Matl.Cost Rev Profit 



convest 


0.4233 


313.8 


216.7 


432.9 


-119.0 


-335.7 


convrdm 


0.3992 


373.7 


116.5 


400.1 


-26.5 


-143.0 


convrdmOl 


0.3692 


369.9 


72.6 


361.4 


8.5 


-64.2 


convrdm90 


0.4067 


349.4 


136.8 


397.1 


-47.7 


-184.5 


gpest 


0.3550 


386.8 


35.2 


344.7 


42.1 


7.0 


gprdm 


0.3550 


386.8 


35.2 


344.7 


42.1 


7.0 


gprdmOl 


0.3550 


386.8 


35.2 


344.7 


42.1 


7.0 


gprdm90 


0.3550 


386.8 


35.2 


344.7 


42.1 


7.0 


revest 


0.4058 


394.0 


104.3 


406.8 


-12.8 


-117.1 


revrdm 


0.3992 


373.7 


116.5 


400.1 


-26.5 


-143.0 


revrdmOl 


0.3758 


360.9 


81.5 


361.4 


-0.6 


-82.1 


revrdm90 


0.4108 


390.6 


112.5 


410.8 


-20.1 


-132.7 



25 

Figure 3 shows a generic personal computer upon which the present 
invention may be practiced. Computer system 301 of Figure 3 includes an 
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address /data bus 306 for communicating information, a central processor 302 
unit coupled with the bus 306 for processing information and instructions, a 
random access memory 304 coupled with the bus 306 for storing information and 
instructions for the central processor 302, a read only memory 303 coupled with 
5 the bus 306 for storing static information and instructions for the processor 302, a 
data storage device 305 (e.g., a magnetic or optical disk and disk drive) coupled 
with the bus 306 for storing information and instructions, a display device 307 
coupled to the bus 306 for displaying information to a computer user, an 
alphanumeric input device 308 including alphanumeric and function keys coupled 

10 to the bus 306 for communicating information and command selections to the 
central processor 302, a cursor control device 309 coupled to the bus for 
communicating user input information and command selections to the central 
processor 302, and a signal generating device 310 coupled to the bus 100 for 
communicating command selections to the processor 302. A copy of the on-line 

1 5 shopping conversion simulation module can be stored in data storage device 305. 
■ , along with the relevant data. Processor 302 processes the information and 
generates a percentage conversion rate for on-line shoppers. 

The preferred embodiment of the present invention, an on-line 
20 shopping conversion simulation module, is thus described. While the present 
invention has been described in particular embodiments, it should be ■ 
appreciated that the present invention should not be construed as limited by 
such embodiments, but rather construed according to the below claims. 



